Context-Sensitive Statistics For Improved Grammatical Language Models
نویسندگان
چکیده
We develop a language model using probabilistic context-free grammars (PCFGs) that is “pseudo context-sensitive” in that the probability that a nonterminal N expands using a rule T depends on N’s parent. We give the equations for estimating the necessary probabilities using a variant of the inside-outside algorithm. We give experimental results showing that, beginning with a high-performance PCFG, one can develop a pseudo PCSG that yields significant performance gains. Analysis shows that the benefits from the context-sensitive statistics are localized, suggesting that we can use them to extend the original PCFG. Experimental results confirm that this is both feasible and the resulting grammar retains the performance gains. This implies that our scheme may be useful as a novel method for PCFG induction.
منابع مشابه
Statistical Language Modeling Using Grammatical Information
We propose to investigate the use of grammatical information to build improved statistical language models. Until recently, language models were primarily innuenced by local lexical constraints. Today, language models often utilize longer range lexical information to aid in their predictions. All of these language models ignore grammatical considerations other than those induced by the statisti...
متن کاملOn Vertical Grammatical Restrictions that Produce an Infinite Language Hierarchy
This paper introduces deriuation table.s that represent a complete grammatical derivations as whole in a vertical way. These tables are obtained by writing the consecutive sentential forms of grammatical derivations vertically one by one. The present paper places and discusses some restrictions on the columns of these tables. IVIore specifically, these restrictions constrain the order of contex...
متن کاملContext-dependent factored language models
The incorporation of grammatical information into speech recognition systems is often used to increase performance in morphologically rich languages. However, this introduces demands for sufficiently large training corpora and proper methods of using the additional information. In this paper, we present a method for building factored language models that use data obtained by morphosyntactic tag...
متن کاملThe Dual Meaning Potential of Prepositional Grammatical Metaphor in Prose Fiction
From a Systemic Functional perspective, Grammatical Metaphor (GM) as is taken to be a chief driving force in the discourse of different genres, an important adult language machinery for ideational meanings to be semantically cross-mapped and realized through a different form in the stratum of the lexico-grammar, in order to convey changed meanings and tinker with the discursive flow and develop...
متن کاملGrammar-based context-specific statistical language modelling
This paper shows how we can combine the art of grammar writing with the power of statistics by bootstrapping statistical language models (SLMs) for Dialogue Systems from grammars written using the Grammatical Framework (GF) (Ranta, 2004). Furthermore, to take into account that the probability of a user’s dialogue moves is not static during a dialogue we show how the same methodology can be used...
متن کامل